The Failure of the Ergodic Assumption 
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The well established procedure of constructing phenomenological ensemble from a single long time 
series is investigated. It is determined that a time series generated by a simple Uhlenbeck-Ornstein 
Langevin equation is mean ergodic. However the probability ensemble average yields a variance that 
is different from that determined using the phenomenological ensemble (time average). We conclude 
that the latter ensemble is often neither stationary nor ergodic and consequently the probability 
ensemble averages can misrepresent the underlying dynamic process. 



o 

(N 
C 

(N 
(N 



c$ 
-f— > 
ctf ■ 

: 

1*3 1 

o ■ 

C/3 ■ 
>Y 

^ : 



> 
o 



o 
o 



PACS numbers: 

The ergodic hypothesis, which states the equivalence 
of time averages and phase space averages, began with 
Boltzmann who conjectured that a single trajectory 
can densely cover a surface of constant energy in phase 
space. This 'proof 'of the hypothesis as well as many 
subsequent proofs were subsequently shown to be fatally 
flawed. It was not until metric decomposability was in- 
troduced by Birkoff [2| that a rigorous mathematical the- 
ory of ergodicity began to take shape. Kinchin Q, who 
wrote a seminal work on the mathematical foundations 
of statistical mechanics, offered a second meaning for the 
ergodic hypothesis, that being, to assume the truth of the 
hypothesis and judge the theory constructed on this basis 
by its practical success or failure. This latter perspective 
is the one adopted by the vast majority of physicists, 
with the subsequent replacement of phase space averages 
with averages over an ensemble probability density. We 
call this latter use of ergodicity the 'ergodic assumption'. 

The ergodic assumption was quite evident in quantum 
measurements, which historically employed single parti- 
cle ensembles to evaluate averages. However the advances 
in instrumentation and measurement techniques over the 
past decade or so enabled the measurement of single 
molecule time series and these relatively recent experi- 
ments forced the ergodic assumption out of the shadows 
into the foreground. Margolin and Barkai Q have argued 
that a wide variety of complex phenomena whose dynam- 
ics consist of random switching between two states, such 
as the fluorescent intermittency of single molecules [|[ 
and nanocrystals @ may display non-ergodicity, which is 
to say the ergodic assumption breaks down. The break 
down occurs because the probability density for sojourn- 
ing within one of the two states is given by the inverse 
power law ip (t) oc so that when /i < 2 the average 
sojourn time within a state diverges and the time series 
is manifestly non-ergodic . A similar breakdown is ob- 
seved in neuroscience where the cognitive activity asso- 
ciated with perceptual judgement is determined to span 
the interval 1 < /i < 3 [8j and consequently to manifest 
non-ergoic behavior in a variety of experiments [t| [1(3] • 

In this Letter we show that the ergodic assumption is 
even more delicate than determined in these experiments, 
which is to say it is less applicable to complex networks 
in general, or physical networks in particular, than was 
previously believed. To establish the extent of the failure 



of the ergodic assumption we begin with a review of its 
introduction into the physics literature. 

A stochastic physical process is denoted by a dynamic 
variable X(t) that can be the velocity of a particle under- 
going Brownian motion, the erratic voltage measured in 
an electroencephalogram or the time series from any of 
a vast number of other complex physical or physiological 
phenomena. In any event in the absence of direct evi- 
dence to the contrary it is assumed that the underlying 
process is stationary in time implying that this observable 
can be determined experimentally from a single histori- 
cal record X(t) measured over a sufficiently long time. As 
stated in a classic paper by Wang and Uhlenbeck (WU) 
in 1945 [TO]: 

..One can then cut the record in pieces of 
length T (where T is long compared to all 
"periods" occurring in the process), and one 
may consider the different pieces as the differ- 
ent records of an ensemble of observations. In 
computing average values one has in general 
to distinguish between an ensemble average 
and a time average. However, for a station- 
ary process these two ways of averaging will 
always give the same result... 

This technique for generating empirical realizations of 
physical ensembles subsequently proved to be very useful 
and has been the form of the ergodic assumption adopted 
to determine average quantities from time series for three 
generations of scientists. The stationarity assumption 
implies that single variable distributions are independent 
of time, so that their ensemble averages and time aver- 
ages are the same. Moreover, the joint distribution func- 
tions for pairs of dynamic variables depends only on 
the difference in time between the measurements of the 
two variables. Formally, this implies that the difference 
variable Z(t,r) = X(t + r) — X(t) is only a function of 
the time difference r, that is, Z(t,r) — Z(r). However 
a stationary time series need not have stationary incre- 
ments and the difference variable can depend on t and r 
separately. Since the introduction of phenomenological 
ensembles over a half century ago the phenomena of in- 
terest have become more complex and consequently the 
WU approach to evaluating averages is often not ade- 
quate. To establish this inadequacy we go back to the 
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prequel of WU, another classic, and consider a process 
first analyzed in detail by Uhlenbeck and Ornstein (UO) 
[H| in 1930. 

Let us consider the UO stochastic process about which 
we know everything so as to test the ergodic assumption. 
Consider the UO Langevin equation 



dX{t) 
dt 



= -XX(t) + F(t) 



where in a physics context X(t) is the one-dimensional 
velocity, the particle mass is set to one, A is the dissi- 
pation parameter and F(t) is a random force. In other 
contexts the dynamic variable has been taken to be the 
voltage measured in an EEG record [l3T | , or any of a num- 
ber of other interesting observables. The solution to the 
UO Langevin equation, as we show below, actually vi- 
olates the stationarity assumption made by Wang and 
Uhlenbeck in 1945. To establish this violation we calcu- 
late the variance using the phenomenological ensemble 
and find that it is different from that obtained using the 
multi-trajectory ensemble (MTE), that is, the ensemble 
obtained by solving the stochastic differential equation 
([T]) using an ensemble of realizations of the random force. 
The significance of this result can not be overestimated 
since it directly contradicts a half century of analyses 
made assuming the ergodicity of the phenomenological 
ensembles constructed from the time series. 

The formal solution of the OU Langevin equation is, 
with the initial condition X(0), 



X(t) 



,-At 



X(0) + / F{t')e M dt' 



(2) 



A proper interpretation of this solution requires the spec- 
ification of the statistics of the random force. If we as- 
sume the random force is a zero-centered delta correlated 
Gaussian process, as is often done, then the solution fl3J 
is generally accepted as a stationary ergodic process. 

Recall that an ergodic process is one for which the time 
average of an analytic function of the dynamic variable 



If g{x) = x, then ([3]) is a generalization of the law of 
large numbers and in the limit of long time lags the 
time average of the dynamic variable exists in proba- 
bility flij . The process in this case is mean ergodic, 
or first-order ergodic, which requires that the autocor- 
relation function have the appropriate asymptotic be- 
havior, that being, for Cx{r) = (X(t)X(t + t)) t we 
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case ((SJ) is true [15j. If we consider the quadratic vari- 
able Z(t) = X(t)X(t + t) then Z(t) is mean ergodic if 

T 

■fp J Cz{r)dT — > as T — > oo, which implies that X{t) is 
o 

second-order ergodic. The question remains whether the 
phenomenological ensemble averages constructed from a 
single historical trajectory of long but finite length T 
satisfies ([5]) for any g{x) other than linear as explicitly 
assumed in WU. It is probably obvious but it should be 
said that first-order ergodicity is important in statistical 
physics because it guarantees that the microcanonical en- 
semble of Gibbs produces the correct statistical average 
in phase space for Hamiltonian systems flij ]. 

To answer the question regarding the validity of (JS|) for 
quantities higher than first-order consider the solution 
to the UO Langevin equation, which since the dynamic 
equation is linear has the same statistics as the random 
force. Assuming the random force is a zero-centered delta 
correlated Gaussian process, we know that the solution 
([2]) is completely determined by the mean and variance. 
The solution with X(0) — yields the average values 
for the dynamic variable (X) T = (X) MTE = 0. The 
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autocorrelation function is Cx(t) = ff-e~ Ar so that the 

T 

condition J Cx(i~)dT — > as T — > oo is satisfied and 
o 

consequently the solution to the UO Lagevin equation is 
first-order ergodic. 

Now let us consider the variance using the MTE av- 
erages consisting of an infinite number of trajectories all 
starting from X(0) = (without loss of generality) yield- 
ing 



{g{X)) T =Yxm ^ I g(X(t'))dt' CM 



and the MTE average 



(g(x)) 



MTE 



g{x)P(x)dx 



are equal 



(4) 



a 2 (t)^{X 2 (t)) MTE -{X(t)) 



'MTE 



2A 



[1 



,-2Atl 



(6) 

Note that © is also the variance obtained using the 
Gaussian solution to the Fokker-Planck equation for the 
ensemble probability distribution. 

Now let us examine the single trajectory case. To cre- 
ate an ensemble of trajectories consider different portions 
of the single trajectory. Let X(t) (X(0)=0) be a single 
trajectory of maximum finite length T and consider the 
set of trajectories for t € [0, T — r] 



(g(X)) T = (g(X)) MTE 



(5) 



Z(t,r) = X(t + r) - X(t). 



(7) 
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Using ([2]), we can write X (t + r) as 



X(t + r) = e 



-At 



o 



(8) 



where the initial condition for the trajectory at t + r is 
the time series at time i. Inserting ([5]) into 0, we obtain 
for t G [0, T - t] 



Z(t,r) 



,-At 



F(t + t')e xt dt' + X(t) [e- XT -1] 



(9) 

Note that the first term of the rhs of (0 is the solution 
of the OU Langevin equation (JlJ with t=r and X(0)= 0, 
apart from the fact that in the integrand of © only the 
time in the range [i,i+r] are considered for the function 
F (instead of the range (0,r) as in (J2])). We now define 
the solution starting from zero to be 



T 

X (t, t) = e- XT J F(t + t')e xt 'dt' 



(10) 



so that the term X(t) in (0) is equivalent to X o (0, t), and 
therefore with t G [0, T — t] © reduces to 



Z(t,T)=X o (t,T)+X o (0,t) [e- XT - 1] 



(11) 



This equation reveals that the statistics of the variable 
Z(t,r) depends on the particular value of t through the 
second term on the rhs. In fact the term Xo(0,t) has 
zero mean and variance that depends on t according to 
© . Also notice that the first term of the above equation 
depends on t but the "dependence" is a time transla- 
tion of the variable F in the integrand of (fTUj) . But the 
random force F is stationary by assumption and thus 
Xo(t,r) is statistically equivalent to Xq(0,t). The func- 
tion Z(t, t) is the sum of two statistically independent 
Gaussian variables and therefore is itself a Gaussian vari- 
able with (Z(t,r)) T = (Z(t,r)) MTE = and second- 
moment 



(Z 2 (t,r)) 



MTE 



,-At 



l]V(t). (12) 



We use this variance to demonstrate that the phenomeno- 
logical ensemble created is not equivalent to MTE be- 
cause the trajectories Z(t, r) are not statistically equiva- 
lent to those of X(t). We show that u^ tra j, the variance 
calculated with the ensemble Z(t,r), is different from 
that calculated using the MTE ©. 

To calculate the variance cr^ tra j(T) using the phe- 
nomenological ensemble, we simply need to calculate the 
mean value (among the trajectories) using p^|) . Here the 
variance is calculated using the time average since only 
the time series is assumed to be available to us 



T-7 



straj 



(r) 



1 



T-t 



dt(Z 2 (t 7 T)) 



MTE ' 



(13) 
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FIG. 1: The variance calculated for the OU process using the 
single long time series and the phenomenological WU ensem- 
ble. Note the difference in the dissipation rates for the two 
ways of calculating the variance. 



Inserting the mean of the phenomenological second mo- 
ment into the integrand yields 



° straj ( r ) 



T-t 



dt 



{a 2 (r)+[e-^-l]V(*)} 



(14) 

and using (jSJ) allows to write after integrating over time 



cr 2 straj (r) = a 2 (r) + [ e -^-l] 



We' 




.2A_ 





1 e-2A(T-r) _ 1 



T-t 



2A 



(15) 



Since AT^>1 (because T is the length of the record) we 
can write 



a 2 straj (T)^a 2 (T)+[e- x ^-l\ 
and substitution from (|6]) yields 



2A 



(16) 



rl raj (T)* U f(l-a- x n. 



(17) 



The last expression shows that the phenomenological en- 
semble trajectories Z(t, r) behave as a MTE with an ef- 
fective dissipation that is half the dissipation rate in the 
Langevin equation generating the process, that is, 



A. 



eff 



A/2. 



(18) 



Consequently, it takes half as long for the MTE variance 
to decay as it does for the phenomenological variance of 
the time series. 
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Consider what has been established in this Letter. 
First a long time series is generated numerically from the 
U) Langevin equation. This long series is separated into 
a large number of equal length time series as prescribed 
by Wang and Uhlenbeck |ll| to form a phenomenolog- 
ical ensemble of realizations of the UO process. This 
phenomenological ensemble of trajectories is shown to be 
mean ergodic. However when they are used to calculate 
the variance of the process anticipating that the same re- 
sult will be obtained as that for a MTE average, the prob- 
ability ensemble average, of the variance. The expected 
result turns out to be wrong. The dissipation parameter 
determined by the MTE variance is twice that obtained 



using the phenomenological ensemble. The inescapable 
conclusion is that the phenomenological prescription for 
calculating the average of any nonlinear function of the 
dynamic variable using time series is not equivalent to 
that using a MTE. The fact that ([5]) is not satisfied for 
g(x) 7^ x implies that the ergodic assumption is violated 
by any finite time phenomenological ensemble. Given 
the simplicity of the dynamic model discussed here; it is 
mean ergodic; combined with the recent findings concern- 
ing the non-ergodicity of experimental data mentioned 
earlier, it is not unreasonable to conclude that the appli- 
cation of the ergodic assumption in general is probably 
unwarranted. 
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